Multiple Imputation of Predictor Variables Using Generalized Additive Models

نویسندگان

  • Roel de Jong
  • Stef van Buuren
  • Martin Spiess
چکیده

The sensitivity of multiple imputation methods to deviations from their distributional assumptions is investigated using simulations, where the parameters of scientific interest are the coefficients of a linear regression model, and values in predictor variables are missing at random. The performance of a newly proposed imputation method based on generalized additive models for location, scale and shape (GAMLSS) is investigated. Although imputation methods based on predictive mean matching are virtually unbiased, they suffer from mild to moderate under-coverage, even in the experiment where all variables are jointly normal distributed. The GAMLSS method features better coverage than currently available methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fitting Generalized Additive Models with the GAM Procedure in SAS 9 . 2

Generalized additive models are useful in finding predictor-response relationships in many kinds of data without using a specific model. They combine the ability to explore many nonparametric relationships simultaneously with the distributional flexibility of generalized linear models. The approach often brings to light nonlinear dependency structures in your data. This paper discusses an examp...

متن کامل

FWDselect: An R Package for Variable Selection in Regression Models

In multiple regression models, when there are a large number (p) of explanatory variables which may or may not be relevant for predicting the response, it is useful to be able to reduce the model. To this end, it is necessary to determine the best subset of q (q ≤ p) predictors which will establish the model with the best prediction capacity. FWDselect package introduces a new forward stepwiseb...

متن کامل

A case study on using generalized additive models to fit credit rating scores

We consider the estimation of credit scores by means of semiparametric logit models. In credit scoring, the fitted rating score shall not only provide an optimal classification result but serves also as a modular component of a (typically quite complex) rating system. This means in particular that a rating score should be given by a linearly weighted sum of rating factors. That way the rating p...

متن کامل

Approximately generalized additive functions in several variables

The goal of  this paper is to investigate the solutionand stability in random normed spaces, in non--Archimedean spacesand also in $p$--Banach spaces and finally the stability using thealternative fixed point of generalized additive functions inseveral variables.

متن کامل

Comparing Different Modeling Techniques for Predicting Presence-absence of Some Dominant Plant Species in Mountain Rangelands, Mazandaran Province

In applied studies, the investigation of the relationship between a plant species and environmental variables is essential to manage ecological problems and rangeland ecosystems. This research was conducted in summer 2016. The aim of this study was to compare the predictive power of a number of Species Distribution Models (SDMs) and to evaluate the importance of a range of environmental variabl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Communications in Statistics - Simulation and Computation

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2016